DCT-based Image/Video Compression: New Design Perspectives
نویسنده
چکیده
To push the envelope of DCT-based lossy image/video compression, this thesis is motivated to revisit design of some fundamental blocks in image/video coding, ranging from source modelling, quantization table, quantizers, to entropy coding. Firstly, to better handle the heavy tail phenomenon commonly seen in DCT coefficients, a new model dubbed transparent composite model (TCM) is developed and justified. Given a sequence of DCT coefficients, the TCM first separates the tail from the main body of the sequence, and then uses a uniform distribution to model DCT coefficients in the heavy tail, while using a parametric distribution to model DCT coefficients in the main body. The separation boundary and other distribution parameters are estimated online via maximum likelihood (ML) estimation. Efficient online algorithms are proposed for parameter estimation and their convergence is also proved. When the parametric distribution is truncated Laplacian, the resulting TCM dubbed Laplacian TCM (LPTCM) not only achieves superior modeling accuracy with low estimation complexity, but also has a good capability of nonlinear data reduction by identifying and separating a DCT coefficient in the heavy tail (referred to as an outlier) from a DCT coefficient in the main body (referred to as an inlier). This in turn opens up opportunities for it to be used in DCT-based image compression. Secondly, quantization table design is revisited for image/video coding where soft decision quantization (SDQ) is considered. Unlike conventional approaches where quantization table design is bundled with a specific encoding method, we assume optimal SDQ encoding and design a quantization table for the purpose of reconstruction. Under this assumption, we model transform coefficients across different frequencies as independently distributed random sources and apply the Shannon lower bound to approximate the rate distortion function of each source. We then show that a quantization table can be optimized in a way that the resulting distortion complies with certain behaviour, yielding the so-called optimal
منابع مشابه
Integer fast lapped transforms based on direct-lifting of DCTs for lossy-to-lossless image coding
The discrete cosine transforms (DCTs) have found wide applications in image/video compression (image coding). DCT-based lapped transforms (LTs), called fast LTs (FLTs), overcome blocking artifacts generated at low bit rate image coding by DCT while keeping fast implementation. This paper presents a realization of more effective integer FLT (IntFLT) for lossy-to-lossless image coding, which is u...
متن کاملMotion analysis in 3D DCT domain and its application to video coding
Global, constant-velocity, translational motion in an image sequence induces a characteristic energy footprint in the Fourier-transform (FT) domain; spectrum is limited to a plane with orientation defined by the direction of motion. By detecting these spectral occupancy planes, methods have been proposed to estimate such global motion. Since the discrete cosine transform (DCT) is a ubiquitous t...
متن کاملFILTER BANK DESIGN FOR IMAGE/VIDEO COMPRESSION AND DIGITAL COMMUNICATIONS by
The growing demands for delivering multimedia services over Internet and wireless networks call for high-performance and low-complexity data compression and transmission algorithms. Various such algorithms are developed in this dissertation using the filter bank theory. We first present a systematic design of multiplierless approximation of the Discrete Cosine Transform (DCT). Our method simpli...
متن کاملEnergy-based adaptive DCT/IDCT for video coding
The two-dimensional Discrete Cosine Transform (DCT) is widely used and probably the most popular method in transform-based image processing and video coding systems since it achieves very high compression ratio and packs the video signal energy into a few coefficients thereby the DCT has well-known decorrelation and energy compaction properties for typical images. The DCT is computationally int...
متن کاملAdvanced Motion Modeling for 3 D Video Coding
Driven by new multimedia applications and the growing demand for more flexible and efficient transmission of video, a new approach to video coding has been recently proposed as an alternative to classical hybrid schemes. Instead of sequential frame-based predictive processing, the new approach is based on spatio-temporal 3D transforms, open-loop nonpredictive processing, and embedded quantizati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014